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ABSTRACT. A random walk on a regular language is a Markov chain on the 
set of all finite words from a finite alphabet A whose transition probabilities 
obey the following rules: (1) Only the last two letters of a word may be 
modified in one jump, and at most one letter may be adjoined or deleted. (2) 
Probabilities of modification, deletion, and/or adjunction depend only on the 
last two letters of the current word. Special cases include (a) reflecting random 
walks on the nonnegative integers; (b) LIFO queues; (c) finite-range random 
walks on homogeneous trees; and (d) random walks on the modular group 
PSL2(Z). It is shown that the n—step transition probabilities of a random 
walk on a regular language must obey one of three different types of power 
laws. The analysis is based on the study of an algebraic system of generating 
functions related to the Green’s function. 


1. Introduction 


The transition probabilities of random walks on the integer lattices, and more 
generally on countable abelian groups, satisfy local limit theorems that are inti- 
mately related to the global limit theorems, such as the CLT, that govern sums of 
i.i.d. random vectors. For random walks on countable nonabelian groups there are 
no such global limit theorems. Nevertheless, it is known that for random walks on 
certain countable nonabelian groups, including the finitely generated free groups 
(20, 10, 14] and the modular group PSL(2, Z) [22], and for certain absolutely con- 
tinuous random walks on semi-simple Lie groups [3, 17] the transition probabilities 
obey local limit theorems; and it is widely suspected that random walks on lattices 
of semi-simple Lie groups obey similar local laws. 

This paper concerns random walks on another class of homogeneous structures, 
the regular languages of computer science [12]. Roughly, a random walk on a regular 
language is a Markov chain on the set of all finite words from a finite alphabet A 
whose transition probabilities obey the following rules: (1) Only the last two letters 
of a word may be modified in one jump, and at most one letter may be adjoined 
or deleted. (2) Probabilities of modification, deletion, and/or adjunction depend 
only on the last two letters of the current word. See section 2.1 for definitions, 
and section 3 for examples. It will be shown that, under suitable irreducibility and 
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aperiodicity hypotheses, the transition probabilities of such random walks must 
follow one of three different types of power laws: as n — 00, 


(1.1) P(Xn = y| Xo = £) ~ Cry R™”, or 
(1.2) ~ CoR nr e, or 
(1.3) ~ CryR ay oP 


for suitable constants Cz, > 0 and R > 1. (Here and throughout the paper 
the symbol ~ indicates that the ratio of the two sides converges to one as n —> 
oo.) Certain countable nonabelian groups, such as (i) the finitely generated free 
groups, (ii) free products of two-element groups, and (iii) the modular group, admit 
descriptions as regular languages, and for random walks on such groups local limit 
theorems will emerge as special cases. In addition, various other Markov chains 
which do not have representations as random walks on groups, such as LIFOs (last- 
in first out queues) with different job types, do have representations as random walks 
on regular languages, and so our results will imply that the transition probabilities 
of such chains must obey one of the three power laws. 

The main tools to be used are Tauberian theorems for generating functions 
(in particular, Darboux’ theorem [2]) and techniques for analyzing singularities 
of generating functions. The key generating functions, the Green’s functions of 
the random walks, will be shown to satisfy recursive relations stemming from the 
recursive nature of a regular language. These recursive relations, in turn, will imply 
that a related finite system of generating functions satisfy a system of polynomial 
equations, and thus are algebraic functions. It will then follow that the singularities 
of the Green’s functions are at a branch point on the Riemann surface determined 
by the algebraic system. Finally, it will be shown that the this branch point has 
ramification two; here the Perron-Frobenius theory of nonnegative matrices will 
play a crucial role. 

The algebraic character of the Green’s functions for random walk on a free group 
was first proved in [1]. This result was subsequently generalized to nearest-neighbor 
random walks on any finitely generated group containing free normal subgroups of 
finite index in [23], and it has been conjectured that these are the only groups 
for which the Green’s functions of nearest-neighbor random walks are algebraic. 
The arguments of papers [1] and [23] give no information about the nature of the 
singularities of the Green’s functions. 


2. Random walk on a regular language 


2.1. Definitions and Hypotheses. A (nearest-neighbor) random walk on a 
regular language (abbreviated RWRL) is a Markov chain X,, on the set A* := 
Un>o0A” of all finite words from a finite alphabet A whose transitions obey the 
following rules: 


(a) Only the last two letters of the current word may be modified. 

(b) At most one letter may be adjoined or deleted. 

(c) Adjunction or deletion may only be done at the end of a word. 

(d) Probabilities of modification, deletion, and/or adjunction depend only on the 
last two letters of the current word. 


More precisely, a RWRL is a Markov chain whose transition probabilities satisfy: 
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(2.1) P( Xian = 2122... O | Xn = £122... 000) = pla'b' lab) 
P(Xn41 = T122... Ema’ | Xp = T122... 1mab) = p(a' jab) 
P(Xn41 = T122... £ma'b'c| Xn = 1112 ...Lmab) = p(a'b'cjab) 


P(Xn41 = a0 | Xn = a) = p(a'b'|a) 
P(Xn41 =a’ | Xn = a) = p(a'la) 
P(Xn41 = Í| Xn = a) = p(Ola) 
P(Xn41 = a| Xn = 9) = p(alO) and 
P(Xn+1 = 0| Xn = 0) = p010). 
Here Ø denotes the empty word. We do not assume that the transition probabili- 
ties are all positive, nor do we assume that there exist positive-probability paths 


connecting any two words. We shall, however, restrict attention to those random 
walks satisfying the following aperiodicity and irreducibility hypotheses: 


ASSUMPTION 2.1 (Aperiodicity). For all words £122... Em, 
(2.2) P(Xn41 = T122... Lm | Xn Spito) >0 
Observe that, in view of the homogeneity restrictions implicit in equations (2.1), this 
is equivalent to assuming that all of the transition probabilities p(ab| ab), p(a |a), 
and p(ý |Ø) are positive. 

ASSUMPTION 2.2 (Irreducibility-A). Let £L C A* be the set of all words that 
may be reached from Ọ via positive-probability paths. Then Yw € L there is a 
positive-probability path from w to 0. 


ASSUMPTION 2.3 (Irreducibility-B). If 
W1 = T1Z2 ...Lma E L and 
We = T1T2... Em E L 
then there exists a positive-probability path from wı to w2 through words of length 
at least m+ 1. 
ASSUMPTION 2.4 (Irreducibility-C). If 


W1 = 11XQ...Lma1dQ...an E L and 
wə = bibə . . bK Y1Y2 - - -Ym E L 

then there exists a word w3 E€ L of the form 
wg = b1bo... bk’ Z122 . . . Zm” Q102 . . . Qk- 


Moreover, for each such triple w1, w2, w3 there exists a positive-probability path from 
we to wa through words of length > k'. 

Henceforth, it is assumed that a RWRL satisfies the aperiodicity and irre- 
ducibility assumptions above. A number of interesting examples are discussed in 
section 3 below. It should be clear that our results may be extended to RWRLs 
satisfying somewhat weaker hypotheses. For instance, the hypothesis that changes 
of state should involve only the last two letters could be replaced by the weaker 
hypothesis that such changes involve only the last K letters, for some K < oo: see 
the discussion in paragraph 3.3 below. The aperiodicity hypothesis (2.2) is mainly 
for ease of exposition — random walks on regular languages that do not satisfy 
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this hypothesis may have transition probabilities that oscillate periodically, with 
any integer period. See section 6 below for an example. Finally, the irreducibil- 
ity assumptions 2.2-2.4 may be weakened; however, the assumptions hold in many 
interesting examples, including those discussed in section 3 below. 

It is easily proved that the set £ of words accessible from the root 0 is a regular 
language, that is, a subset of A* consisting of all words accepted by a finite-state 
automaton. See [12] for background on regular languages. Since our results and 
arguments will make no appeal to results from the theory of formal languages, we 
shall not discuss this point in any further detail; however, we note in passing that it 
is essentially the recursive structure of £ that lies behind the system of polynomial 
equations (5.9) below, on which our analysis is based. 


2.2. Local Limit Theorem. The main results of this paper are the following 
theorems, which concern the asymptotic behavior of the transition probabilities 
P”{ Xn = y} := P(X, = y| Xo = 2). 

THEOREM 2.5. Let Xn be a null recurrent RWRL. Then for each pair x,y € L 
there exists a constant 0 < Cry < œ such that as n —> oo, 


(2.4) PP{X, = y} ~ Cyn? 


THEOREM 2.6. Let Xn be a transient RWRL. Then for each pair x,y € L there 
exists a constant 0 < Cry < œ and 1 < R < œ (R is the spectral radius of the 
random walk) such that one of the following holds as n —> oo: 


(2.5) P”{Xn = y} ~ CyR™; or 
(2.6) P%{Xn = yY} ~ Cry Rn"; or 
(2.7) PPIX = yY} ~ Cry RonF”, 


The proofs of these theorems for the special case where x and y are words of 
length < 1 will be carried out in sections 5, 7, and 8 below. The general case, 
where x,y are arbitrary, may be deduced from this special case by a “recoding” 
trick — see paragraph 3.3 below. It will be shown in section 6 that all three cases 
of Theorem 2.6 may occur. It is known (see, for instance, [14] for the proof) that 
for random walk on a nonamenable group the power laws (2.5) and (2.6) cannot 
occur. Thus, for the examples considered in sections 3.5 and 3.6 below, only (2.7) 
is possible. 

It is somewhat mysterious that only the three possibilities (2.5)-(2.7) can occur. 
Although we have no purely probabilistic explanation, we believe that the latter 
two power laws correspond to different possible limit laws for the “pinned” random 
walks. Consider the renormalized wordlength process 


(2.8) Wn (t) = |X in |/ Vn, 
where |- | denotes wordlength and |-| denotes greatest integer. We conjecture that 


the power laws (2.6) and (2.7) correspond, respectively, to the following possibilities 
for the processes {Wn (t) }o<t<1: 


(2.9) W(t) 2, Brownian bridge, and 
(2.10) W,(t) 2, Brownian excursion. 


These are, of course, the only possibilities for the reflected random walks discussed 
in sections 3.1 and 6 below. Additional evidence for the conjecture is given in 
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a recent paper of BOUGEROL & JEULIN [4], where it is shown that the renormal- 
ized distance-to-the-identity processes for isotropic random walks on the hyperbolic 
plane must converge to the Brownian excursion. 


2.3. Malyshev’s Theorem. Random walks on regular language have previ- 
ously been studied by MALYSHEV and co-authors in [18] and [9], in which they are 
called “random strings”. Malyshev proves the following basic theorem: 


THEOREM . The wordlength Ln := |Xn| of a RWRL satisfies a SLLN: 
(2.11) Ln/n—> pu almost surely as n — oo, 
and u = 0 if and only if Xn is recurrent. In all cases, there is “edge stabilization”: 
for any m > 1, the joint distribution of the last m letters of Xn converges weakly 
as n > o. 


Theorems of this type, of course, give no information about the detailed as- 
ymptotic behavior of the transition probabilities. 


3. Examples of random walks on regular languages 


3.1. Nearest-neighbor reflecting random walks on Z4}. The most basic 
example is the simple nearest-neighbor p — q random walk with reflection at the 
the origin. This is the Markov chain on Z, with transition probabilities 


(3.1) P(Xny1 =2+1|Xn = x) 
P(Xnq1 = 4 —-1|X, =£)=1-p if x > 1; 
PX =1|Xn = 0) = 
PX =0|Xn=0)=1-r 


r 


Henceforth, we shall refer to this random walk as RRW (p; r). This is clearly equiv- 
alent to a random walk on the regular language £ = {a }m>0 with a one-letter 
alphabet. Notice, though, that this random walk does not satisfy the aperiodic- 
ity hypothesis (2.2) and in fact, as we will show in section 6 below, the transition 
probabilities do not obey the local limit theorem (2.7), but rather exhibit a periodic 
oscillation. 


3.2. Non-nearest-neighbor reflecting random walks on Z4}. Consider a 
Markov chain on Z, with transition probabilities 
(3.2) P(Xny1 =£ +k|X, = 2) = pp for |k| < K anda > K; 
P(Xn41 = £ + k| Xn = £) = Dek for 0<xz< Kand —x<k<K. 


Such a process behaves as a (homogeneous) random walk with increments bounded 
by K outside a finite neighborhood {0,1,--- , K — 1} of the origin. It is not difficult 
to see that a Markov chain governed by transition probabilities (3.2) is equivalent 
to a random walk on the regular language 


L= {a"b|m € Z} and b € {0,1,2,---,K —1}}. 


The word a™b corresponds to the integer mK +b. Since the increments in Xn 
are of magnitude < K, only the last two letters of the representing word aa--- ab 
need be modified to account for any possible change of state. Local limit theorems 
for Markov chains with transition probabilities (3.2) were proved, by Wiener-Hopf 
methods, in [15]. 
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3.3. The “recoding” trick. The hypothesis that transitions in a RWRL 
involve only the last two letters of the word representing the current state may 
seem at first glance to be quite restrictive. However, the trick used in the preceding 
paragraph to represent non-nearest neighbor reflecting random walks on Z} as 
RWRLs may be adapted to various other Markov chains. 

Suppose, for instance, that X, is a Markov chain on A* whose transition prob- 
abilities obey the restrictions (a)-(d) of section 2.1, except that in any given transi- 
tion, the last K letters (instead of the last 2) may be modified, and the probabilities 
of adjunction, deletion, and modification depend on the last K letters (not just the 
last 2). Although the transition probabilities of the Markov chain X,, do not obey 
the rules (2.1) (unless K = 2), it is not difficult to construct a RWRL Y, that “repre- 
sents” Xn as follows: Replace the alphabet A by the larger alphabet B := UK_, A™ 
consisting of all words of length < K from A. For each finite word zx in the letters 
A, let y = y(x) be the word in the letters B obtained from x by grouping letters of 
x in blocks of K, starting at the beginning of the word. For example, if K = 3: 


T = a1020304 =>> 
y(x) = (a1a2a3)(a4). 


Observe that the mapping x +> y(x) is bijective. It is routine to verify that the 
transition probabilities of the Markov chain Y,, = y(Xn) obey the rules (2.1). 

Notice that the recoding trick just described also produces a RWRL when the 
input process X,, is already a RWRL. In this case, the recoding collapses words of 
length < K to words of length 1. Therefore, to prove Theorems 2.5-2.6 for arbitrary 
x,y, it suffices to prove them for words x,y of length 0 or 1. 


3.4. LIFO Queues with multiple job types. A LIFO (last-in first-out) 
queue, also called a stack, is a data structure used in many fundamental algorithms 
of computer science (see, for instance, [5], section 11). The queue is stored as a 
linked list. New items are adjoined at the top of the linked list (“push”); and when 
items are removed, they are removed, one at a time, from the top (“pop”). In 
certain algorithms (for instance, when the stack is used to implement a recursion), 
the items in the list represent jobs to be completed, one at a time. These jobs may 
be of different types. Assume that the set A of possible job types is finite. 

Now suppose that inputs to the algorithm are random. Then during the ex- 
ecution of the algorithm the stack will evolve as a stochastic process whose state 
space is a subset of the set A* of finite words over the alphabet A. In algorithms 
where the state of the computation depends only on the top job in the stack (as 
is often the case) and inputs to the algorithm are i.i.d., the sequence of successive 
states of the stack will constitute a random walk on a regular language. 


3.5. Random walk on a finitely generated free group. The free group 
G = G(B) generated by a finite set B = {b1,b2,...,b2} consists of all reduced 
finite words in the letters B* = {bf', by',...,b¢'}. (A word is said to be reduced 
if no letter b51 is adjacent to its inverse bF.) Multiplication in G consists of 
concatenation followed by reduction; the empty word @ is the group identity. A 
nearest-neighbor random walk on G is a Markov chain Xn that evolves according to 
the rule 


(3.3) Xn+1 — Xnén+1 
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where €1,&,... are iid. random variables valued in B® U {Ø}. Clearly, post- 
multiplication by an element of B can modify only the last letter of a word in G, 
and so the transition probabilities implied by (3.3) must satisfy the hypotheses (2.1). 
If each element a € B* U {Ø} is assigned positive probability by the distribution 
of €,, then the aperiodicity and irreducibility hypotheses of section 2.1 will be 
satisfied. 

More generally, consider a non-nearest-neighbor random walk on G whose in- 
crements are bounded. The law of evolution is again given by (3.3), but now it 
is assumed that the jumps £n are i.i.d. random variables valued in the set of all 
reduced words of length < K. These random walks may also be represented as ran- 
dom walks on a regular language, using the recoding trick of paragraph 3.3 above. 
The appropriate regular language £ consists of all finite words a,a2...amb, with 
m > 0, such that (i) each a; € G is a word of length K (in the generators of the 
group G); (ii) b € G is a word of length 0 < |b| < K — 1; and (iii) no cancellations 
between successive a;,a;4+1 (or between am, b) are possible. It is easily verified that 
if the distribution of €; attaches positive probability to each word of length < K 
then the aperiodicity and irreducibility hypotheses of section 2.1 are satisfied. 

Finally, observe that if the transition probabilities are modified at finitely many 
elements of G, then the resulting Markov chain, a perturbed random walk, although 
no longer a “random walk” (in the usual sense of the term), is nevertheless still a 
random walk on a regular language. If the modifications are such that positive- 
probability transitions remain positive-probability transitions, then the aperiodicity 
and irreducibility hypotheses will still hold. However, whereas the power laws (2.5) 
and (2.6) cannot occur for processes of the form (3.3), they can occur for perturbed 
random walks. 


3.6. Random walk on the modular group PSL(2,Z). The modular group 
is the quotient group PSL(2,Z) = SL(2,Z)/{+I} where SL(2,Z) is the (multi- 
plicative) group of 2 x 2 matrices with integer entries and determinant 1, and T is 
the 2 x 2 identity matrix. (For information about the modular group and its con- 
nections with the theory of elliptic modular forms, and also for specific facts used 
in the subsequent discussion, see [16], especially Chapter XI.) A (right) random 
walk on PS'L(2,Z) is a sequence 


(3.4) Xn = 162... En 


where &1,€,... are iid. random variables valued in PSL(2, Z). We shall assume 
that (a) the distribution of €; has finite support; (b) P{€: = I} > 0; and (c) 
that for every element A € PSL(2,Z), the random walk {Xn }n>0 visits A with 
positive probability. We shall show that any such random walk has a description 
as a RWRL. The assumptions (b)-(c) above guarantee that the irreducibility and 
aperiodicity assumptions 2.1, 2.2, and 2.3 will be satisfied. 

The modular group is finitely generated, with generators 


f1 ee eee ee ee ie 
(3.5) ae ns U =, oh and T =T =f, i 


(More accurately, the group is generated by the equivalence classes (T) := {+T} 
and (U) := {+U}. Henceforth, we shall not distinguish between matrices and their 
equivalence classes in PSL(2,Z).) These generators satisfy the relations 


(3.6) T? = (TU)? =I. 
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Thus, every element of PSL(2,Z) may be written as a finite word in the letters 
T,U,U—' in which no two T’s appear consecutively, and no U is adjacent to a UTH. 
It follows that the modular group is also generated by T,W,W7?, where W = TU. 
The relations (3.6) translate as T? = W° = I. Every element M € PSL(2,Z) has 
a representation as a finite word 


(3.7) M=T°W™TW™@TW"s ..- TW T? 


where a,b = 0 or 1 and each n; is either 1 or 2. 

The modular group contains free normal subgroups of finite index, and so it 
has a representation as a regular language. Following is a sketch of how such a 
representation may be obtained (see [16], section XI.3E for details). Let I’ be the 
commutator subgroup of PSL(2,Z), that is, the subgroup consisting of all finite 
products of commutators. (The commutators of any group are the elements of the 
form aba~'b~+.) The commutator subgroup of any finite or countable group is a 
normal subgroup, and so I” is a normal subgroup of PSL(2, Z). The group I” is 
free (see [16], Theorem XI.3E) , with generators 


(3.8) X =TWTW’, X`! = WTW°T, 
Y =TW°TW, and Y`! = W°TWT. 


The index of IY in PSL(2, Z) (that is, the number of distinct cosets of I’) is 6. Thus, 
since I” is a normal subgroup, one may choose 6 distinct elements A1, Ag,..., A6 
of PSL(2,Z), one from each coset, such that every element of PS'L(2,Z) has a 
representation 


(3.9) M =WA,, 


where W is a reduced word in the (free) generators X, X~!, Y, Y~+. Observe that 
there may be some cancellation of letters at the end of W; but since there are only 
6 possibilities for A;, there are only finitely many possible types of cancellation, in- 
volving at most a bounded number of letters at the end of W. Thus, there is a finite 
subset B C PSL(2, Z) such that every element of PSL(2,Z) has a representation 
as a reduced word 


(3.10) M = W'B;, 


where W’ is a reduced word in X, X~!, Y, Y~! and B; € B. This exhibits PSL(2, Z) 
as a regular language £, with alphabet A = {X,X~!,Y,Y~!} UB. 

Now consider a right random walk (3.4) whose jumps &; satisfy the assump- 
tions (a), (b), and (c) above. By (a), the distribution of €; has finite support; 
consequently, there are only finitely many different ways that multiplication by 
€n41 can modify the end of the word Xn. In particular, there is a finite integer K 
such that all positive-probability transitions X, — Xn+1ı involve only the last K 
letters of Xn, and such that the transition probabilities depend only on the last K 
letters of the current state Xn. By replacing the regular language £ by a regular 
language £L’ whose alphabet consists of the words of length < K in £, one obtains 
a representation of the random walk X, as a random walk on a regular language 
whose transition probabilities obey the restrictions (2.1) of section 2.1. 

A local limit theorem for random walk on the modular group was proved by 
Woess [22], using different and somewhat more difficult arguments than those em- 
ployed below. Later [23], Woess also proved that, for any nearest-neighbor random 
walk on a discrete group containing a free normal subgroup of finite index, the 
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Green’s function is algebraic; however, his arguments do not allow one to deduce 
the type of the leading singularity. 


4. Darboux’ Theorem 


The asymptotic behavior of the transition probabilities P*(X,, = y) is governed 
by the behavior of the generating function )>,, 2” P*(X,, = y) near its smallest pos- 
itive singularity. To extract the information about the transition probabilities from 
the behavior of this generating function, we shall appeal to a Tauberian theorem 
of DARBOUX [2]. 

DARBOUX’ THEOREM . Let G(z) = XZ? o anz” be a power series with radius 
of convergence R. Suppose that G has no singularities in the closed disk |z| < R 
except z = R, and that in a neighborhood of z = R, 


(4.1) G(z) = A(z)(1 — z/R)* + B(z) 
where A(z) and B(z) are analytic near z = R, A(R) £ 0, and a € R\{0, —1, —2,...}. 


Then as n > oo, 


A(R) 


4.2 ea ee 
a 2 T(—a)nita R” 


More generally, if G has finitely many singularities ¢1, ¢2,...,¢m on the circle |z| = 
R of convergence, near each of which it has a representation 


(4.3) G(z) = Ai(z)(1 — 2/G)™ + Biz) 


with A;(z) and B;(z) analytic near z = ¢; and a; € R \ {0,—1, —2,...}, then as 
n— œ, 


(4.4) anv” Fi Ai(Gi) 


Paint ce 


provided the right side is not 0. 

The hypotheses of Darboux’ theorem are tailored to power series that define 
algebraic functions. A complex function G(z) defined for z € Q, where Q is a 
nonempty open subset of C, is said to be algebraic (more precisely, a branch of an 
algebraic function) if there is an irreducible polynomial P(z,w) in two variables 
such that 


(4.5) P(z,G(z)) =0 VzEn 


Observe that if the polynomial relation (4.5) holds in Q, then it holds for any 
analytic continuation of G to a domain containing 2, by the Law of Permanence 
of functional equations. See [11] for a thorough discussion of algebraic functions, 
including a proof of the following fundamental result: 

PROPOSITION . If G(z) is an algebraic function, then it has only finitely many 
singularities in the (extended) complex plane, and near each singularity has the 
form (4.3), with rational exponent a. 

If G(z) = $p oan2” is an algebraic function with a singularity at z = R 
and no other singularities in |z| < R, then the conclusion (4.2) holds provided the 
function A(z) in the representation (4.1) is not zero at z = R. Rather than verify 
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this directly, we shall find it more convenient to work with leading term of the 
Puiseux series of G at z = R. The Puiseux series (see [11]) is 


(4.6) G(z) = G(R) + D ba (R -— 2), 


where k > 1 is an integer; the series „>; bnz” has positive radius of convergence. 
Using the Darboux theorem, one may easily check that the asymptotic behavior of 
the coefficients an is determined by the exponent of the first (nonconstant) nonzero 
term in the Puiseux expansion. Thus, we have the following simplification in the 
hypotheses of Darboux’ theorem: 

COLLARY 4.1. Let G(z) = Epo an2” be a power series with nonnegative co- 
efficients an and radius of convergence R. Suppose that the function G(z) has no 
singularities in the closed disk |z| < R except z = R, and that as z — R— (through 
real arguments less than R) either 


(4.7) G(z) ~ C(1— z/R)* or 
(4.8) G(R) — G(z) ~ C(1— z/R)°, 


with C > 0. If G(z) is an algebraic function, then relation (4.2) holds with A(R) 
replaced by C. 


5. Green’s functions and related generating functions 


5.1. The Green’s Functions. Assume now that X, is a random walk on 
a regular language £ satisfying the aperiodicity and irreducibility hypotheses of 
section 2.1. For x,y € £, define the Green’s function Gzy(z) as follows: 


co [oe 
(5.1) Gay(2) := PX, Hye = E” (> 2”1{X, = ») 
n=0 n=0 
Since the coefficients in the power series are all probabilities, the series converge 
absolutely and uniformly in the closed unit disc |z| < 1. Thus, the power series 
of each of the Green’s functions has radius of convergence at least 1. In section 
7.2 below, we shall prove that all of these power series have the same radius R of 
convergence. Since their coefficients are nonnegative, and, by Assumption 2.2, at 
least some are strictly positive, the following must hold: 
(a) Gry(z) and its derivatives are positive for 0 < z < R. 
(b) Gzy(z) is strictly increasing and strictly convex in the interval [0, R). 
(c) lim, r- Goy(z) = Gry(R) € (0, oo]. 
(d) Gzy(z) has a singularity at z = R. 
Property (d) is PRINGSHEIM’s theorem (see [11]). 
The Green’s functions G,,(z) are intimately related to, and in fact completely 
determined by, a finite system of generating functions Hap,-(z) indexed by words 
ab and c of lengths 1 and 2, respectively. These are defined as follows: 


(5.2) Hav celz) = EV 2? {Xp = ch = > a P®{T =n and Xn = c} 


n=1 


where 


(5.3) T :=min{n : |X,| < |Xol}. 
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Observe that some of the generating functions Ha»,-(z) may be identically zero, as 
there may be no positive-probability paths from ab to c through words of length > 2. 
However, Assumptions 2.2-2.4 imply that at least some of the functions Hap,<(z) 
are not identically zero. We shall refer to these as the nondegenerate first-passage 
generating functions. In section 7.1 below we will show that the nondegenerate 
first-passage generating functions Hap,-(z) have a common radius p of convergence, 
and in section 7.3 that p > R. Because the coefficients in the power series are 
nonnegative, and, for the nondegenerate generating functions, not all zero, the 
nondegenerate generating functions Hab e(z) satisfy properties (a)-(d) above, with 
R replaced by p. 

The relationship between the Green’s functions and the first-passage generating 
functions Hab e(z) is a simple consequence of the Markov property. Observe that the 
second equality in (5.1) shows that the Green’s function G,,(z) may be interpreted 
as the expected discounted number of visits to y when the initial state is x. Let x 
and y be words of lengths |x|, |y] < 1. If Xo = x, then Xı must be a word of length 
0,1, or 2; if X; is a word of length 2, then there can be no more visits to y until 
after the Markov chain X,, returns from the set of words of length > 2. Hence, by 
the Markov property, if |x| < 1 and |y| < 1, then 


(5.4) Goy(z) = d2(y) +2 XO p(wla)Guy(z)+ 25° X p(ablx) Hap,c(z)Gey(z), 
w:|w|<1 ab c 


where J`, is over all two-letter words, and }°, is over all one-letter words. 

The equations (5.4) may be written compactly in matrix form as follows. Define 
G(z) to be the matrix-valued function of z with entries G,,(z), indexed by words 
x,y of lengths < 1, and define P to be the matrix of transition probabilities p(y|z), 
where again x and y are words of length < 1. Define K(z) to be the matrix-valued 
function of z with entries K,,(z), indexed by words x,y of length < 1, defined by 


(5.5) Koy(2) =) plablz)Hasy(2) if lul = 1; 
ab 


=0 if |y| = 0. 
Then equations (5.4) are equivalent to the matrix equations 
(5.6) G(z) = I+ zPG(z) + zK(z)G(z) => 
I= (I -zP —- zK(z)) G(z) 
where I is the identity matrix. Notice that for all z of sufficiently small modulus |z], 


the matrix (zP+zK(z)) has norm less than one, and so the matrix (I— zP — zK(z)) 
is invertible; thus, 


(5.7) G(z) = (I — zP — 2K(z))* 


for all z in a disc centered at z = 0. 


5.2. Algebraic relations among the generating functions. Equations 
(5.6) above show that the generating functions G,, and H,,,, are algebraically 
interrelated. In this section, we will show that the auxiliary generating functions 
Hy are themselves algebraically interrelated via a system of quadratic equations 
that define an algebraic curve. This will imply that these functions are algebraic. 
This, together with the equations (5.6) above, will imply that the Green’s functions 
Gyy(z) are algebraic functions of z. 
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Recall (see equations 2.1)) that, for a random walk on a regular language, only 
the last two letters of the current word play a role in the transition to the next word. 
Consequently, for any initial state € = x,x%2...%mab E L, the letters £1£2... Em 
are ignored up to time T. It follows that for any word € = 71%2...%,,ab € L, and 
any c € A, 


(5.8) Hab e(z) = ESzT1{Xr = £182... EmC}. 


The homogeneity relations (5.8) lead directly to a system of quadratic equations 
relating the functions Hab,- These equations derive from the Markov property: 
Starting from any word w, the random walk must make an initial jump to another 
state. If this first jump is to a word whose length is less than that of the initial 
word w, then T = 1; otherwise, the initial jump must be either to a word w’ of the 
same length, or to a word w” with one additional letter. In the latter case, T is 
the first time after the initial jump that two letters are removed; in the former, T 
is the first time that a single letter is removed. Summing over all possible initial 
jumps and applying the identity (5.8) yields the following system of equations: 


(5.9) Hav,c(z) = zp(clab) + zp(ablab) Hav,c(z) +z 5 p(de|ab)Hae, e(z) 
d,e€ A 


+z S° pldeflab) X Hep.g(2)Hag,c(2)s 


d,e,fEA gEA 


We shall abbreviate this system of equations as 


(5.10) H(z) = 2Q(H(z)) 


where H(z) is the vector of nondegenerate first-passage generating functions Hap,<(z) 
(that is, those that are not identically zero), and Q is the vector of quadratic poly- 
nomials Qab, in the variables Hab, occurring on the right sides of the equations 
(5.9) above. Observe that Q is quadratic (that is, each of its components is qua- 
dratic) and does not depend on the variable z. Moreover, the coefficients in the 
polynomials Qab, are positive. This will be of crucial importance later. 


5.3. Algebraic curves and the algebraic character of the Green’s func- 
tions. An affine (algebraic) variety V C C” is defined to be the set of all zeros 
(@1,@2,.-., Qn of a finite set of polynomials in n variables over the field C of complex 
numbers. (See, for instance [6], Ch. I, for the definition. Some authors use a dif- 
ferent terminology, calling what we have defined to be an affine variety an algebraic 
set, and reserving the term variety for an irreducible algebraic set.) The system of 
equations (5.10) determines an affine variety Vp, since it consists of polynomial (in 
fact, quadratic) equations in the variables z and Hab,- Similarly, the system con- 
sisting of equations (5.10) and (5.6) determines a variety Vox. (Observe that Vi is 
obtained by projecting the variety Vag onto the z and H,,,. coordinates.) Notice 
that the point (0, H(0)) lies on the variety Vz, and that the point (0, H(0), G(0)) 
lies on Vou. 

A basic theorem of algebraic geometry (see [6], section 4.6) states that any 
affine variety V has a decomposition V = Ut_,V; into irreducible affine varieties, 
where a variety W is termed irreducible if the only decompositions W = U; U U2 of 
W into two varieties are such that W = U or W = U3. 
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PROPOSITION 5.1. There exists an open neighborhood N of the point 
(0, H(0), G(0)) 


such that N N Vor is contained in an irreducible algebraic curve (a variety of 
dimension 1). 


PROOF. See [6], Chapter 9, for the definition of the (algebraic) dimension of 
an affine variety. For our purposes, it will only be important that the algebraic 
dimension coincides with the transcendence degree (see the proof of Corollary 5.2 
below) of the field of rational functions over C. The system of equations consisting 
of (5.10) and (5.6) has the form 


(5.11) F(z,w) = 0, 
where w = (w1, W2,.-..,Wm) and F is an m— vector of polynomials in the variables 
z and w1, W2,..., Wm. If (z.,w.) is a solution of the equation (5.11), then it is said 
to be a nonsingular point if the Jacobian matrix 

OF 
(5.12) J:= 

Ow 


is nonsingular at (z, w) = (z.,w.). If (24, w.) is a nonsingular point of the variety 
Vr defined by the system (5.11), then, by Theorem 9.6.9 of [6], it lies on a unique 
irreducible component of the variety Ve of dimension 1. For the system consisting 
of equations (5.10) and (5.6), the function F may be written F = (Fy, Fa), where 
F(z, H) = H — zQ(H) and 
Fe(z,H, G) = (I — zP — zK(z)) G(z). 


The Jacobian matrix at (0,H(0),G(0)) has the form 
I O 
v=(. 4) 


where J denotes the identity matrix and O the matrix all of whose entries are 0. 
This is certainly nonsingular, so the point (0, H(0), G(0)) is a nonsingular point of 
Vex, and it lies on a unique irreducible component of dimension 1. 


Remark. Observe that the condition (5.12) is also the hypothesis of the (complex) 
Implicit Function Theorem (see, for example, [13], Chapter 3). This theorem states 
that, if the Jacobian matrix J is nonsingular at (z.,w.) then, for each z in a 
neighborhood of z,, there is a unique solution w = w(z) of (5.11), and the mapping 
z ++ w(z) is analytic. In particular, the locus of solutions to (5.11) near (z,,w.) is 
analytically parametrized by a single parameter z; in other words, it is a complex 
manifold of (topological) dimension one. The theorem 9.6.9 of [6] cited in the proof 
above states, in essence, that the topological and algebraic dimensions of the variety 
coincide at nonsingular points. 


COLLARY 5.2. Each of the functions Gry and Hap,< occurring as entries of the 
Green’s function matriz G(z) and the first-passage generating function vector H(z) 
is algebraic. 
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PROOF. Let V be the irreducible component the variety Vg containing the 
point (0,H(0),G(0)). Each of the coordinate variables z, Hay, and Gzy is a 
member of the field C[V] of rational functions on the variety V (see [6]), section 
5.5). By Theorem 9.5.2 of [6], the transcendence degree of the field C[V] (that is, the 
maximal number of algebraically independent elements) coincides with the algebraic 
dimension of V, which, by Proposition 5.1, is 1. Thus, there is a polynomial relation 
between any two elements of C[V], in particular, between z and Hab,c, and between 
z and Gry. 


In principle, a polynomial P(z, w) relating z and any one of the Green’s func- 
tions can be obtained by an “elimination” algorithm, starting from the system (5.9) 
and (5.6). See [6] for an elementary discussion of elimination theory. Unfortunately, 
in all but the very simplest cases, the system (5.9) is either too large for the com- 
putation to be carried out, or leads to a polynomial P of too high a degree to be 
solved by quadratures for Gyy(z) in terms of z. There is, however, one interesting 
case where the system may be solved explicitly — this is discussed in the next 
section. 


6. A Special Case: Reflecting Random Walk on Z4, 


This special case is instructive, in that explicit forulas for the generating func- 
tions may be obtained, and all of the possibilities enumerated in Theorems 2.5 
and 2.6 are exhibited. However, the results of this section will play no role in the 
subsequent sections. 


6.1. Explicit expressions for the Green’s functions. The Green’s func- 
tions of the random walks RRW (p; r) defined in section 3.1 may be written in closed 
form as algebraic functions of their arguments. Let X,, be the Markov chain on the 
set of nonnegative integers with transition probabilities given by (3.1). Define the 
generating functions G(z), F(z) by 


(6.1) G(z) = 5 z” P”{X„n=0} and 
n=0 
(6.2) F(z) = b's" where 
(6.3) T =min{n: Xn = 0} 
By (5.4) and the Markov property, (see [7], section XIV.4, for similar calculations), 
(6.4) G(z) =(1-—2(1—r)-—2rF(z))"* and 
(6.5) F(z) = (1 — p)z + pzF(z). 


Notice that the algebraic system (5.10) reduces to the single quadratic equation 
(6.5). This equation has two solutions; since the function F(z) defined by (5.2) is 
nondecreasing in z for 0 < z < 1, the correct branch is 

_ 1— y1- 4p(1 = pe 

7 2pz ` 


(6.6) F(z) 


The smallest positive singularity is at z = p where 


(6.7) p = plp) = 1/(2vp(1 — p)). 
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This is a branch point of order two. Observe that —p is also a branch point of 
order two, but that there are no other singularities on the circle |z| = p. Notice 
also that the function H(¢) := /CF(./¢) has only one singularity on its circle of 
convergence, at Ç = (4p(1—p))~+, and so by Darboux’ theorem, as n — oo through 
the odd integers, 


(6.8) PHT =n} ~ Alp; r) n? 
for a suitable value A(p; r) € (0,00). 


6.2. The Case p = 1/2. The reflecting random walk RRW(1/2;r) is null 
recurrent for every value of r € (0,1). The equation (6.4) for G(z) simplifies to 


(6.9) G(z) = (1 —2(l-r)-r+rvy1 A), 


which shows that G has singularities at z = +1. These singularities are of different 
types, however, as G(z) — 0 as z — 1, while G(z) => 2—r #0 as z > —1. It is 
routine to check that 


(6.10) G(z) = A4) (1-271? + B}(z) nearz=1 and 
(6.11) G(z) = A-(2)(1- 2)! + B_(z) near z = —1 


for analytic functions A+, B+. Thus, by Darboux’ theorem, as n > oo the return 
probabilities satisfy 


(6.12) P°{X, =0} ~ = 


for a suitable constant C = C(1/2;r) > 0. 


6.3. The Case p > 1/2. When p > 1/2 the reflecting random walk RRW (p; r) 
is transient for every r € (0,1). The nature of the leading singularity of the Green’s 
function G(z), and its location vis a vis the singularity z = p(p) of the first-passage 
generating function F(z), depend on the value of r. Observe that if p > 1/2 then 
F(1) < 1, since the random walk is transient. In fact, by (6.6), 


(6.13) F(p(p)) = F0/(2Vp(1 — p))) = vV (1 — p)/p < 1, 
and so, by the strict monotonicity of F(z) on the positive real axis, F(z) < 1 for 
all z < p(p). 

Define 

l-p 

6.14 eS) abe 
(6.14) (») =1- 75 

LEMMA 6.1. Let z = ¢ = C(p;r) be the smallest root of the equation 
(6.15) l=2(l—-r)+rzF(z) 


in the interval z € (0, p(p)], provided there is one. Then for each p > 1/2, the zero 
¢(p;r) is continuous and strictly increasing in r forO<r<r.(p). Atr=r.(p), 


(6.16) S(p: r) = p(y) = 1/(2V p01 — p)). 
For r > r,(p), there are no zeros of the equation (6.15) in the interval (0, p(p)]. 
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PROOF. Since F(1) < 1, the zero ¢(p;1r), if it exists, must be strictly greater 
than 1. Moreover, since F(z) is strictly increasing in (0, p], there is at most one zero 
in (0, p]. For each z € (0, p(p)], the function z(1 — r)+rzF(z) is strictly decreasing 
in r, since F(z) < F(p(p)) < 1 by (6.13). Consequently, the function ¢(p;1r) is 
strictly increasing in r. That ¢(p;r) is continuous in r follows from the implicit 
function theorem, as F’(z) > 0 for all z € (0, p(p)]. 

For all r such that (1 —r)~! < p(p), the function z > 1 — 2(1 — r) — rzF(z) 
must have a zero in the interval (1,(1—1r)~'), by the intermediate value theorem. 
Hence, ¢(p;7r) is defined for all small r > 0. By the preceding paragraph, the 
function ¢(p;1r) is strictly increasing and continuous in r up to the point rą where 
¢(p; rx) = p = p(p). This occurs at the solution r = r, of the linear equation 


1 = p(1 -= r) + prF(p). 
Using (6.6), it is easily checked that rą =7.(p) is given by (6.14). 


COLLARY 6.2. Assume that p > 1/2. Let rą = r.(p), ¢ = C(p;r), and p = 
p(p) be as in Lemma 6.1 and equation (6.7). Then for the reflecting random walk 
RRW (p;r) the transition probabilities satisfy 


(6.17) PX, =0} ~ CC” if 0< Tr < Ta; 
(6.18) P°{X, = 0} ~ Cp n? if è r=rů; 
(6.19) P°{X, =0} ~ (C+ (1) C) n3 if oan <r<1, 


for suitable constants C = C(p;r) > 0 and C! = C'(p;r) 4 0. 


PROOF. If r < r.(p) then by Lemma 6.1 the smallest positive zero ¢(p;1r) of the 
equation (6.15) is strictly less than the radius p(p) of convergence of F(z). Hence, 
by equation (6.4), the smallest positive singularity of G(z) is at ¢(p;7r) < p(p). Since 
F'(z) > 0 at z = C(p;r), and since F(z) is analytic in a neighborhood of ¢(p;r) it 
follows that G(z) has a simple pole at z = ¢(p;r). It is not difficult to show, using 
the explicit expressions (6.4) and (6.5), that G(z) has no other singularities on the 
circle |z| = ¢(p;r). The relation (6.17) follows. 

Consider next the case r > r,(p). Since p(1 —1r) + prF(p) is strictly decreasing 
in r (recall that F(p) < 1, by (6.13)), and takes the value 1 at r,(p), it follows that, 
for r > r.(p), 

1> p(l—r)+prF(p). 
Consequently, by equation (6.4), G(z) is an analytic function of z and F(z) ina 
neighborhood of any point (£, F'(€)) where || = p, and has power series expansion 


G(2) = GE) + GE)? (= 6) ns Pe) + Erl) — FE) o) 


Since F(z) has branch points of order two at z = +p, so does G(z); and since 
F(z) has no other singularities on the circle |z| = p, neither does G(z). Thus, by 
Darboux’ theorem, the asymptotic formula (6.19) holds. 

Finally, consider the case r = r,(p). In this case the Green’s function once 
again has singularities at z = +p, and at no other points on |z| = p; however, 
G(z) becomes infinite as z — p, but remains finite as z — —p. The nature of the 
singularity at z = +p may be deduced from the identity 


1/G(z) =1-2(1-1r) - zrF(z); 
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this implies that, in a neighborhood of z = p, 
G2) = A(2)(p— 2)-¥? + BY) 
for suitable analytic function A(z), B(z). Similarly, near z = —p the function G(z) 
satisfies 
G(z) = C(2)(p— 2)? + D(z) 
for suitable analytic function C(z), D(z). Thus, by Darboux’ theorem, (6.18) holds. 


Remark 1. It may strike the reader as somewhat strange that the period-two 
oscillation in the transition probabilities occurs only for r > r,(p). Here is an 
explanation: Conditional on the event X,, = 0, the distribution of the total number 
Nn of returns to 0 before time n satisfies 


Ny = 00 ifr <r,(p), but 
Nn = Op(1) if r > r,(p) 


as n — oo. Since the state 0 has a positive holding probability, the many returns 
to 0 destroy any aperiodicity when r < r.(p). 


Remark 2. The random walks considered in this section do not obey the hypothe- 
ses of Theorems 2.5-2.6, and so by themselves do not prove that all the different 
cases of these theorems actually occur. However, it is a routine (if tedious) matter 
to obtain explicit formulas for the Green’s functions of modified reflecting random 
walks with holding probability a > 0 for all states x > 1, and to deduce from 
these expressions local limit theorems similar to those obtained above. The only 
difference in the results is that the period-two oscillation in formula (6.19) does not 
occur. 


7. Analysis of the leading singularity 


We return now to the study of random walks on regular languages in general. 
Recall from section 5 that the matrix G(z) of Green’s functions Gz,(z) is a ratio- 
nal function (5.6) of the generating functions Hap,-(z), and that these generating 
functions are interrelated by the system (5.10) of quadratic equations (5.9). As ear- 
lier, we denote by H the vector of nondegenerate first-passage generating functions 
Hqp,c(Z). 


7.1. Radius of convergence of the system (5.10). In this paragraph we 
will show that all of the nondegenerate first-passage generating functions Hap,<(z) 
(that is, all those that occur as entries of the vector-valued function H(z)) have the 
same radius of convergence, and that all have the same singularity type at z = p. 
In general, two algebraic functions y(z) and ¢(z), both with singularities at z = Ç, 
will be said to have the same singularity type at the point z = ¢ if (a) either y(z) 
and ~(z) are both finite at z = Ç, or are both infinite; and (b) the leading terms in 
the Puiseux expansions of y(z) and (z) at z = ¢ have the same exponent a > 0, 
that is, 


plz) ~x C-z)” and = o(z)~C"(G-2z)% or 
P(C) — plz) ~ C(¢ — z)* and (C) -= ylz) ~ C (C — z)* 
for suitable constants C, C” Æ 0. 
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PROPOSITION 7.1. All of the generating functions Hav e(z) occurring as entries 
of H(z) have the same radius p of convergence, and all have the same singularity 
type at z = p. 

The proof will rely on an irreducibility property of the system (5.10) of qua- 
dratic equations governing the generating functions Hab e(z). The system (5.10) 
asserts that the function H(z) is a fixed point of the mapping F +> zQ(F); conse- 
quently, H(z) is a fixed point of any iterate of this mapping. In particular, 


(7.1) H(z) = (zQ) 0 (zQ) o--- o (zQ)(H(z)) := z™Q/™(H(z)), 


where o denotes functional composition. The following lemma concerning this hier- 
archy of polynomial equations is the key point in the analysis where the Irreducibil- 
ity Assumptions 2.3 and 2.4 are used. 


LEMMA 7.2. There exists an integer m > 1 such that, in each component of 
(zQ)°™(H), each variable Harc appears in a term with positive coefficient. 


PROOF. In each component equation (5.9) of the vector equation (5.10), the 
right side contains a term that is z times a constant p(ab|ab) times the left side. 
Consequently, when equation (5.10) is substituted in the mth iterate to obtain the 
(m + 1)—th iterate, any product of the variables Hab, that occurred as a term in 
the mth iterate will remain as a term in the (m+ 1)—th iterate, albeit with a higher 
power of z. Therefore, to prove the lemma it suffices to show that, for each pair 
Hav c, Harv œ of components of H, there exists m > 1 such that Hay. appears as 
a factor in some term of the Hay,-—component of the m—th iterate of (5.10). 

Recall that if Hab e(z), Harv’ œ (z) are entries of H(z) then they are not identi- 
cally zero, equivalently, there exist positive-probability paths from ab to c and from 
a’! to c through words of length > 2. Let 


ab > w, > w2 -> Wn > cC 


be such a path from ab to c. By Assumption 2.4, there is a positive-probability 
path from wn to a word w,wa’d’ through the set of words prefaced by wn. By the 
homogeneity property (5.8), 


Erva Tif Xr = wnwe } = Hawe (z) #0. 


By Assumption 2.3, for each consecutive pair 8, y of letters in the word we’, there 
is a positive-probability path from 8y to @ through words prefaced by 3. Conse- 
quently, for sufficiently large m, the equation for Ha»,-(z) in the mth iterate of the 
system (5.10) contains a term with the factor Harv (z). 


Proof of Proposition 7.1. That all of the generating functions H(z) have the 
same radius of convergence follows immediately from Lemma 7.2, because, since 
H(1) > 0 and A’(z) > 0 for z > 0 and for each function H, Lemma 7.2 implies 
that for any pair H, H, there are constants C, C” > 0 such that 


(7.4) H(z) > CH, (z) and H,(z) > C'H(z) 


for all z > 1. It also follows from this that either all of the values H(p) are +co, 
or all are finite (and, of course, positive). In the case where all are +00, the 
inequalities (7.4) imply that the leading terms in the Puiseux expansions all have 
the same exponent a. 
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Suppose that all of the functions H are finite (and positive) at their common 
radius of convergence p > 1. By Lemma 7.2, for each integer k > 1 there exist 
constants Ck, C}, > 0 such that, for any pair H, H,, the kth derivatives satisfy 
(7.5) HO (A) > CHP and =H) (z) > CLH®(z). 

It follows that the leading terms in the Puiseux expansions must all have the same 


exponent a. 


Henceforth, we shall refer to p as the “radius of convergence” of the system 
(5.10). 


LEMMA 7.3. Let Hap,e(z) be nondegenerate, and let 0 < r < p be such that 
Hapc(r) < co. Then for any complex number Ç £ r such that |¢| =r, 


(7.6) |Hab,c(¢)| < Hav,elr) 
ProoF. This is a consequence of Assumption 2.1, which implies that, for some 
€ > 0, p(ablab) > £. This, in turn, implies that, for each integer n > 1, 
(7.7) PT =n + Xs = 0} SEP = n; Xn = c}. 
Consider any two two consecutive terms of the power series expansion of Hap,<(¢), 
with an := P%®{T =n; Xn = c}: 
lang” T Ce Ginza = r” lan T An+16| 
<r" lan + an+ır| 


= anr” + anyir". 


This shows that the sum of any two consecutive nonzero terms in the power series 
expansion of Hqp,-(¢) is strictly less, in modulus, than the sum of the corresponding 
terms in the power series expansion of Hap e(r). 


7.2. Radius of convergence of the Green’s functions G,,(z). 


PROPOSITION 7.4. All of the Green’s functions Gyy(z) have the same radius of 
convergence R > 1. Moreover, all of the Green’s functions Gyy(z) have the same 
singularity type at z = R. 


PROOF. The argument follows similar lines as the proof of Proposition 7.1, but 
here Lemma 7.2 is replaced by a “Harnack”-type inequality. By Assumption 2.2, for 
any two elements u, v of the regular language £, there are positive probability paths 
from u to v and from v to u. Consequently, for any two pairs (x, y),(x’,y’) € £? 
there exist k > 0 and £ > 0 such that 


(7.8) P*{Xntn = y} > EP” {Xn = y'}. 


This inequality clearly implies that the Green’s functions G,,(z) and Gzy (z) have 
the same radius of convergence R, and that if Gay (R) = œ then Gz,(R) = oo. It 
follows that the Green’s functions G,,(z) are either all finite or all infinite at their 
common radius of convergence R. 

Because each G',,(z) is an algebraic function, it has a Puiseux expansion at 
z= R. Since G,,(z) is strictly increasing in z along the line segment [0, R], the 
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first two terms of the Puiseux series must be of the form 
(7.9) Gzyl R) — Gry(z) ~ Cry(R — 2 if Gey (R) < 00; or 
(7.10) Gay(z) ~ Cry( R — 2) 47 if Gry(R) = œ, 
where Q@z, > 0 and Czy > 0. Consider first the case where all the Green’s functions 
are infinite at z = R: By inequality (7.8) above, for any two pairs (x, y), (2’, y’) € £, 
and any a > 0, there exists a constant £ > 0 such that 
(R -= z)%°Gry(2) > E(R — 2) Gurl), 
Consequently, in case (7.10), all of the exponents a, must be the same. 

Now consider the case (7.9). Once again, by inequality (7.8), for any two pairs 
(x,y), (x',y') € £, and any a > 0, there exist constants €% > 0 such that the kth 
derivatives of the Green’s functions satisfy 

(R= z)*G®)(z) > eR =a a e. 


TY 


This implies that in case (7.9) all of the exponents a,, are the same. 


7.3. The Perron-Frobenius matrices M(z) = (P+ K(z)). It is not neces- 
sarily the case that R = p: as we have seen in section 6, it is possible that R < p, 
and also that R = p. Later we will show that R > p is impossible. Recall equation 
(5.6), according to which 


(7.13) I = (I- zP — zK(z)) G(z). 


This implies that if z is such that the matrix (I — zP — zK(z)) is invertible, and 
if K is analytic in a neighborhood of z, then the entries Gz, of G are finite and 
analytic near z, because (5.7) implies that they may be expressed as convergent 
Neumann series in the functions Kry. Consequently, if z = ¢ is a singularity of one 
of the Green’s functions Gz,(z) then either z = ¢ is a singularity of H(z) or the 
matrix (I — zP — zK(z)) is noninvertible. The following result will be of crucial 
importance in investigating the invertibility/noninvertibility of this matrix. 

LEMMA 7.5. For 0 < z < p the matric M(z) := P + zK(z) is a Perron- 
Frobenius matriz. 


PROOF. Recall that a Perron-Frobenius matrix is a square matrix M such that 
some positive power M” of M has strictly positive entries. For z > 0, the matrices 
P and K(z) have entries 


(7.14) Pry := p(ylz), 
(7.15) Kay(2) = X p(ab|x) Hasy(2), 
a,b 


where p(-|-) are the transition probabilities in (2.1) and Hab, e(z) is defined by (5.2). 
The indices x,y are words in £ of length 0 or 1. For any such pair x,y of indices, 
there is a positive-probability path from x to y, by Assumption 2.2. If there is 
a positive-probability path of length n > 1 through words of length < 1, then 
certainly P% > 0; and since p(w|w) > 0 for all elements of £, by Assumption 2.1, 
it then follows that Pre k > 0 for all k > 0. If there is not a positive-probability path 
from x to y through words of length < 1, then there must be a positive-probability 
probability path from x to y that visits a word of length 2. Such a path visits a 
sequence of words of length < 1, then a word w of length 2, then, after wandering 
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among the words of length > 2, another sequence of words of length 1, etc., and 
finally arrives at y: 


T —> L1 >> LQ +++ Lk, SWI 


Tkı+1 Tkı+2 S Tko w2 


Tkə+1 
—> Tk, Wr 
>Tm = y 
For each pair w = w;,v = £k;+1 in this sequence, it must be the case that Hy,,»(z) > 
0 for z > 0, because there is a positive-probability path from w to v through words 
8 
of length > 2. Moreover, for each pair 7;,2;41 not straddling a w;, it must be the 
case that p(xi+ı|zi) > 0. Consequently, for all z > 0, 
(P+ K(z))f7" > 0 => 
(P+K(2)) "tE > 0 Vk>0. 


It now follows that, for all sufficiently large n > 1, the matrix (P + K(z))” has 
strictly positive entries. 


Because M(z) := (P + K(z)) is a Perron-Frobenius matrix for all 0 < z < p, 
the conclusions of the Perron-Frobenius theorem (see, for example, [21]) apply: In 
particular, M(z) has a positive eigenvalue u = u, = p(z) of multiplicity one, and 
the rest of its spectrum is contained in the open disc |¢| < u+. Moreover, there are 
unique right and left eigenvectors a = a, = a(z) and 8 = B, = B(z), with strictly 
positive entries, normalized so that the following system of equations hold: 


(7.16) M(2)6(z) = w(z)8(z); B(z)F a(z)= 1; 
a(z)"M(z) = w(zja(z)?; a)l = 1, 


where, in the last equation, 1 denotes the vector with all entries 1 (thus, the equa- 
tion asserts that a(z) is a probability vector for each z € (0, p)). 


LEMMA 7.6. The functions u(z), a(z), and B(z) extend to analytic functions 
in a neighborhood of the line segment (0, p), and for z € (0,p), the function p(z) 
is nondecreasing in z. Furthermore, if all of the first-passage generating functions 
Hay.c(z) are finite at z = p, then p(z), a(z), and B(z) are continuous on the closed 
interval z € (0, p]. 


PROOF. The entries of M(z), being linear combinations of the first-passage 
generating functions Hab e(z), are analytic functions of z for |z| < p. Moreover, if 
all of the functions Hab, e(z) are finite at z = p, then as z — p— the entries of M (z) 
converge monotonically to those of M(R). Consequently, the analyticity and con- 
tinuity of u(z), a(z), and 8(z) follow from standard results in regular perturbation 
theory (see, for example, [19], section XII.1), because the lead eigenvalue a(z) has 
multiplicity one at every z € (0, p). That the eigenvalue ju(z) is nondecreasing in z 
for 0 < z < p follows because the entries of M(z) are nondecreasing in z. 


LEMMA 7.7. zu(z) <1 for all z € [0, RA p). 
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PROOF. Suppose not; then there exists Ç € (0, R A p) such that u(¢) = 1. By 
Lemma 7.6, the function zu(z) is strictly increasing in z along the line segment 
(0, p), and so zu(z) < 1 for all z € [0,¢). Because zu(z) is the spectral radius 
of zM(z), it follows that the matrix (I — zM/(z)) is invertible for all z € (0,¢). 
Consequently, the equation (7.13) is solvable for G(z) along (0, Ç), via the equation 
(5.7), and so, by the Law of Permanence of functional equations, the relation (7.13) 
holds for all z in a neighborhood of the segment [0,¢). Taking the inner products 
with the left eigenvector a(z)? and the vector 1 now gives 


(7.17) 1 = (1 — zp(z))a(z)? G(z)1, 


valid for all z € [(0,¢). Now ¢ < RA p, by hypothesis, and so each of the functions 
u(z) a(z), and G(z) is continuous at z = ¢. Thus, relation (7.17) persists at 
z = Ç. But Cu(¢) = 1, and so equation (7.17) cannot possibly hold at z = Ç unless 
G(¢) = œ, which would imply that ¢ = R. 


LEMMA 7.8. For each r € (0, p) and each C £ r of modulus |¢| = r, the spectral 
radius of M(¢) is strictly less than u(r). 


PROOF. The entries of M(z) = (P + K(z)) are given by equations (7.14) and 
(7.15). The nonzero entries of K(z) are positive linear combinations of the functions 
Hap,c(z). By Lemma 7.3, these satisfy the inequalities 


|Hab,c(¢)| < Hat,c(r). 


Consequently, the nonzero entries of M(¢) are strictly less, in modulus, than the 
corresponding entries of M(r). Therefore, the spectral radius of M(Ç) is strictly 
less than that of M(r). 


PROPOSITION 7.9. If R< p then z = R is a simple pole of each Gzy(z). 


PROOF. Recall that R, being the common radius of convergence of the Green’s 
functions G,,(z), is a singularity of G(z). Recall also that the functions Hp,-(z) are 
analytic in the disc |z| < p, which contains the closed disc |z| < R. Hence, if R < p, 
then, by equation (7.13), the matrix (I — M(z)) must be noninvertible at z = R, 
because if it were invertible then the function G(z) would extend analytically to a 
neighborhood of z = R, which it does not. 

By the preceding lemma, the Perron-Frobenius eigenvalue pu(z) of M(z) is 
strictly less than 1/z for all z € (0,R), and, by Lemma 7.6, it is continuous in 
z for 0 < z < p. Since u(z) is the spectral radius of M(z), if Ru(R) < 1 then the 
matrix (I — RM(R)) would be invertible, which it is not. Therefore, if R < p then 


(7.19) Ryu(R) = 1. 
Now consider the function F(z) := a(z)?G(z)1. This is a linear combination 


of the entries G,,(z) of the Green’s matrix G(z). These entries are multiplied by 
entries of the left eigenvector a(z)”, which are strictly positive for 0 < z < p and 
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analytic in z. By equation (5.6), for all z € (0, R), 
(7.20) F(z) = a(z)?G(z)1 

= a(z)71+4 za(z)? M(z)G(z)1 

= 14+ 2p(2)F(2) 

= (1 = zp(2)) 7}, 
which shows that F(z) has a simple pole at z = R, since zu(z) > las z > R 
and p(z) is nondecreasing in z. Since F(z) is a positive linear combination of the 
entries of the Green’s function matrix G(z), and since all of these entries have the 


same singularity type at z = R (by Proposition 7.4) it follows that that each of 
these entries has a simple pole at z = R. 


COLLARY 7.10. If R< p then for each pair x,y E L of words of length 0 or 1, 
there exists a constant Cy, > 0 such that, as n —> oo, 


(7.21) PPX = y} ~ Cry R”; 


ProoFr. By Proposition 7.9, if R < p then G(z) has a simple pole at z = R. 
By Lemma 7.8, if ¢ 4 R and |¢| = R then the spectral radius of M(C)is strictly less 
than that of M(R), which, by equation (7.19), is 1/R. Consequently, for each such 
Ç, the matrix (I — ¢M(¢)) is invertible. Since the entries of M (z) are analytic in 
|z| < p, it follows by the implicit function theorem that, for each ¢ # R such that 
\¢| = R, the Green’s function matrix G(z) is analytic in a neighborhood of z = ¢. 
Thus, G(z) has no other singularities in the closed disc |z| < R. It follows that the 
coefficients P”{X„ = y} of Gzy(z) must obey the asymptotic law (7.21) for some 
nonzero constant Cry. Since the coefficients are probabilities, the constant must be 
positive. 


PROPOSITION 7.11. If R £ p, then R= p. In this case, for each entry Hab c of 
H, 


(7.22) Havel p) < œ. 


PROOF. By Lemma 7.7, if R > p then zu(z) < 1 for all z € [0, p). It follows that 
the entries of the matrix zM (z) remain bounded as z — p—. Since the entries of 
M(z) are positive linear combinations of the probabilities p(y|a) and the functions 
Hg e(z) it follows that at least some of the functions Hg e(z) remain bounded as 
z = p—. Proposition 7.1 implies that all of the functions Hg e(z) remain bounded 
as z > p—. 


7.4. The Perron-Frobenius matrix of the system (5.10). Having estab- 
lished the local limit theorem (7.21) in the case R < p, we now turn our attention 
to the case 


(7.23) R=p. 

By Proposition 7.9, if (7.23) holds then all of the entries Hap,-(z) of H(z) are finite 
at z = p. Since all of the generating functions Hab e(z) occurring as entries of 
H have singularities at z = p, where p is the common radius of convergence, the 
point z = p must be a branch point of the algebraic curve (5.10). In particular, 
by the implicit function theorem, z = p must be the smallest positive real number 
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where the linearization of the system (5.10) becomes singular. In vector form, the 
system (5.10) reads H(z) = zQ(H(z)); the linearization is obtained by taking the 
differential of each side: 


dH = (dz)Q(H) + (3) (dH) <= 
(7.25) (x 2 aR) (dH) = (dz)Q(H) 


The analysis of the linear system (7.25) is similar to the analysis of the linear 
system (7.13) in paragraph 7.3 above. Once again, a crucial role is played by a 
one-parameter family of matrices which are Perron-Frobenius matrices along the 
line segment (0, p]. These are defined as follows: 


(7.26) Bae a 


LEMMA 7.12. For each z € (0, p], the matriz L, is a Perron-Frobenius matriz. 


PROOF. Since the components of Q(H) are polynomials in the variables Hab, 
with positive coefficients, so are the entries of the matrix L, = 0Q/0H, as are the 
entries of any positive power L? of L,. Since those generating functions Hap,<(z) 
that occur as entries of the vector H(z) are strictly positive for 0 < z < p, it follows 
that for each z € (0, p] and m > 1, the entries of L? are nonnegative. It remains 
to be shown that, for sufficiently large m, these entries are nonzero. 

Let H = Hab and H, = Haw œ be entries of H, and, for each m > 1, let 
Q (H) be the H—component of the vector polynomial Q°”(H). We will show 
that the (H, H,.)—entry of the matrix L?” is nonzero if and only if the variable H, 
occurs as a factor in a term of Q97"(H).The proof is by induction on m. When 
m = 1, the statement is certainly true, because L, = 0Q/0H. Assume, then that 
the claim is true for all positive integers < m. Consider the (H, H.)—entry of the 
matrix L”*!; this will be nonzero if and only if, for some entry H,, of H, the 
(A, H,...)—entry of L? and the (H.+, H,)—entry of L, are both nonzero. By the 
induction hypothesis, this occurs if and only if H,, occurs as a factor in a term of 
Qr (H) and H, occurs as a factor in a term of Qa (H). But this occurs if and only 
if H, occurs as a factor in a term of Qi") (A), because the vector polynomial 
Q°("+) (H) is obtained by substituting Q(H) for H in Q°”(H). 

Now recall Lemma 7.2, according to which there exists m > 1 such that, in 
each component of Q°™”, every entry H of H occurs as a factor in a term with 
positive coefficient. In view of the result of the previous paragraph, it follows that, 
for any such m, every entry of L” is nonzero. 


Remark. The argument used in the second paragraph of the proof may be modified 
to prove a somewhat stronger statement. If m > 1 is so large that every component 
of Q°”(H) contains every H = Hab, as a factor in a nonzero term, then, by 
resubstitution, it follows that every component of Q°?”"(H) contains the square of 
every H = Hab, as a factor in a nonzero term. Consequently, every entry of the 
matrix L2” contains every H = Hab, as a factor in a nonzero term. 


Because L, is a Perron-Frobenius matrix when 0 < z < p, it is governed by 
the conclusions of the Perron-Frobenius theorem. Thus, it has a positive eigenvalue 
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A = Az = A(z) of multiplicity one, and the rest of its spectrum is contained in 
the open disc |¢| < Az. Moreover, there are unique right and left eigenvectors 
w = w, = w(z) and v = v; = v(z), with strictly positive entries, satisfying 


(7.27) L(z)v(z) = X(z)v(z); v(z)? w(z)= 1; 
w(2)TL(z) = Aaywl2)™ waT =i, 


where, in the last equation, 1 denotes the vector with all entries 1. Since the 
matrix L(z) depends analytically on z for |z| < p, and since the multiplicity of the 
eigenvalue A(z) is 1 at every z € (0,p], the functions A(z), w(z), and v(z) are all 
analytic functions of z for z in some open neighborhood of the line segment (0, p] 
(see [19], section XII.1). Since the entries of the matrices L, are nondecreasing in 
z, for 0 < z < p, the Perron-Frobenius eigenvalue A(z) is also nondecreasing in z. 


LEMMA 7.13. A(p) = 1/p. 


PROOF. Suppose that pA(p) < 1. Since the spectrum of pL, lies entirely in 
the disc |¢] < pA(p), the matrix I — pL, would be invertible. Consequently, the 
linearization (7.25) of the algebraic system (5.10)) at z = p would be solvable for 
dH: 

dH = dz (I — zL.)~' zQ(H)). 
But this would imply, by the implicit function theorem, that H(z) admits an ana- 
lytic continuation to a neighborhood of z = p, which is impossible, since z = p is a 
singularity of H(z). 

Now suppose that pA(p) > 1. Since z + zX(z) is analytic, and therefore 
continuous, along the line segment (0, p| and takes the value 0 at z = 0, it would 
follow, by the intermediate value theorem, that rA(r) = 1 for some r € (0, p]. But 
this would imply that the matrix (I — rL,) is singular, and that the system (7.25) 
cannot be solved for dH in terms of dz at z = r. This, in turn, would imply that 
z = r is a singularity of the function H(z), a contradiction, since z = p is the 
smallest positive singularity. 


COLLARY 7.14. If the RWRL Xn is transient, then p > 1. 


ProoF. If the RWRL is transient, then at least one of the first-passage gener- 
ating functions Hab e(z) must satisfy 


(7.29) Hanell) <1. 


LEMMA 7.15. For each z Æ p in the disc |z| < p, the spectral radius of the 
matrix L, is strictly less than 1/p. 


PROOF. It suffices to show that, for some integer m > 1, the spectral radius of 
L?” is less than 1/p™. Recall that the entries of the matrix L? are positive linear 
combinations of products of the variables z and Hap,-. Moreover, if m is sufficiently 
large, then each entry contains every Hab, as a factor in some term with positive 
coefficient (see the remark following the proof of Lemma 7.12). Consequently, by 
Lemma 7.3, if z # p but |z| = p then all of the entries of L? are strictly less 
in modulus than the corresponding entries of Lr’. It now follows by a routine 
argument that the spectral radius of L?’ is strictly less than that of Lf’, which, by 
Lemma 7.13, is 1/p™. 
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LEMMA 7.16. None of the functions H(z) occurring as entries of H have sin- 
gularities on the circle |z| = p other than those at z = p. 


PROOF. By Lemma 7.15, if z 4 p but |z| = p then the spectral radius of zL, is 
strictly less than 1, and so the matrix I — zL, is invertible. Thus, the linearization 
(7.25) of the system (5.10) is solvable for dH in terms of dz at z. By the implicit 
function theorem, H admits an analytic continuation to a neighborhood of z, and 
so z is a regular point of H. 


It remains to determine the nature of the singularity at z = p. By Corollary 
5.2, each of the generating functions Hab e(z) is algebraic; by Proposition 7.1, that 
all of the nondegenerate Hap e(z) have the same singularity type at z = p; and by 
Proposition 7.11, if pọ = R then all of the functions H,,(z) are finite at z = p. 
Therefore, there exist constants a > 0 and Cy > 0 such that for each nondegenerate 
H = Hab,c, 

(7.30) H(p) — H(z) ~ Cu(p -= 2)“. 

PROPOSITION 7.17. a = 1/2. 


PROOF. Consider once again the system H(z) = zQ(H(z)) of quadratic equa- 
tions (5.9) for the vector H(z) of nondegenerate first-passage generating functions 
Hab e(z). These equations are valid for all z in the disc |z| < p, and, by continuity, 
for all z on the circle of convergence |z| = p. Taking differences, we have 


(7.31) H(p) — H(z) = pQ(H(p)) — zQ(A(z)) 
= p(Q(H(e)) — Q(H(z))) + (e — 2)Q(H(p)) 
— (p— 2)(Q(A(p)) - Q(A(2))). 
Writing AH := H(p) — H(z) and Az := p — z and collecting terms, we obtain 
(7.32) (1— pL,) AH = (Az)Q(H(p)) 
— p(quadratic terms in AH) 
— (Az)(linear and quadratic terms in AH). 


Since the quadratic polynomials Q(H) have nonnegative coefficients, so do the 
quadratic terms on the right side of (7.32), and at least some of these coefficients 
are positive. Now take the inner product with the left eigenvector v, of the Perron- 
Frobenius matrix L,: since pA(p) = 1, by Lemma 7.13, this gives 


(7.33) (Az) (v, Q(H(p))) =p(quadratic form in AH) 
+ (Az)(linear and quadratic terms in AH). 


Since the eigenvector vp has strictly positive entries, the coefficient v7 Q(H(p)) of 
Az on the left side is positive. Moreover, the coefficients in the quadratic form are 
nonnegative, and at least some of them are positive; hence, the quadratic form is 
nonnegative definite and nondegenerate. This implies that the exponent in (7.30) 
must be a = 1/2. 


COLLARY 7.18. Assume that R = p. Then there exists C, > 0 such that as 
z => R-, 


(7.34) H(R) — (2) ~ C,(R- 2)". 
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PROOF. When R = p, the first-passage generating functions Hab, e(z) are finite 
at z = R, by Proposition 7.11. Consequently, the entries of M(z) are finite at 
z = R, because they are positive linear combinations of the functions Hap,-(z). 
Since the matrix M(R) is a Perron-Frobenius matrix, its leading eigenvalue has 
multiplicity one, and so, by regular perturbation theory [19], the leading eigenvalue 
is an analytic function of the matrix entries in a neighborhood of M(R). Thus, in 
particular, the function u(z) is given by a convergent power series in the entries of 
M(R) — M(z), and the linear terms of this power series are not all zero (because 
locally, near M(R), the spectral radius must increase linearly with any entry). The 
relation (7.34) therefore follows from Proposition 7.17. 


7.5. Behavior of the Green’s functions at R = p. When R = p, all of 
the first-passage generating functions Hay,-(z) are finite at z = R, by Proposition 
7.11, have Puiseux series of the form (7.30) at R, and have no other singularities 
on the circle |z| = R, by Lemma 7.16. By Lemma 7.7, for every z € (0, R) the 
spectral radius zp(z) of zM(z) := z(P + K(z)) is strictly less than one, and so by 
the continuity of u(z), either 
(7.35) Ryu(R) <1 or 
(7.36) Ru(R)=1 
The special cases considered in section 6 show that both possibilities may occur. 

PROPOSITION 7.19. For each entry Gy,(z) of the Green’s function matrix G(z), 
there exists a constant Cry > 0 such that, as z => R-, 

(7.37) Gay(z) ~ Cry(R—z)-¥? if Ru(R) = 1; and 
(7.38) Gay (R) — Gry(2) ~ Cry(R = 2)! if Ru(R) <1. 

PROOF. By Corollary 5.2, all of the Green’s functions Gz,(z) are algebraic; 
and by Proposition 7.4, all have the same singularity type at their common radius 
R of convergence. Consequently, all have convergent Puiseux expansions at z = R. 
It must be shown that the leading terms of these expansions are as implied by 
(7.37)-(7.38). 

Since all of the leading terms have the same exponent, by Proposition 7.4, it 
will suffice to consider a convenient positive linear combination of the entries of 
G(z). Thus, consider the function F(z) = a(z)?G(z)1, where a(z)” is the left 
Perron-Frobenius eigenvector of M(z) and 1 is the vector of all 1s. By equation 


(7.20), 
F(z) = (1 — zu(2))1. 
Hence, by Corollary 7.18, 
F(z) ~ Cr(R- 2)? if Ru(R) = 1; and 
F(R) — F(z) ~ Cp(R- z2)? if Ru(R) <1. 
Since lim,_,r_ a(z) = a(R), it follows that 
a(R)TG(z)L ~ Cr(R—2z)7/? if Ru(R) = 1; and 
a(R)? G(R)1 — a(R)7G(z)1 ~ Cr(R- 2)! if Ru(R) <1. 
The relations (7.38)-(7.37) clearly follow. 


zZ 
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COLLARY 7.20. For any two words x,y of lengths |x|,|y| < 1, there exists a 
constant Cry > 0 such that, as n —> œ, 


(7.40) PY LX, = y} ~ Cry R” if R< p; 
(7.41) PX = y} ~ Cory R n"? if R= p and Ru(R)= 1; and 
(7.42) P°{Xn =y} ~ CryR n? if R=p and Ru(R) <1. 


PROooF. The first statement has been proved in Corollary 7.10. In view of 
Corollary 5.2 and Proposition 7.19, the latter two results are immediate conse- 
quences of Corollary 4.1. 


8. Transience and the spectral radius 


By Corollary 7.20, the only possible limiting behaviors of the transition prob- 
abilities are (7.40), (7.41), and (7.42). Thus, to complete the proofs of Theorems 
2.5-2.6, we must show that R = 1 if and only if the the RWRL is recurrent, because 

(a) if R = 1, then (7.40) implies that the transition probabilities converge to 
nonzero limits, and so the RWRL is positive recurrent; 
(b) if R = 1, then (7.41) implies that the transition probabilities converge to 
zero, and so the RWRL is null recurrent; 
(c) if R = 1, then (7.42) implies that the transition probabilities are summable 
in n, and so the RWRL is transient, by a standard test for recurrence; and 
(d) if the RWRL is recurrent, then the Green’s functions are all infinite at 
z = 1, by the same test for recurrence, and so R = 1. 

PROPOSITION 8.1. If the RWRL is transient, then R > 1. 

It is possible to prove this in a purely “analytical” fashion, using Lemma 7.13 to 
show that p = 1 is impossible unless all the nondegenerate first-passage generating 
functions Hab e(z) take the value 1 at z = 1, which implies that the RWRL is re- 
current. However, it is simpler to use a probabilistic argument based on a standard 
large-deviations estimate, which we record in the following lemma. 


LEMMA 8.2. Let 1, £2,... be a sequence of Bernoulli random variables adapted 
to a filtration {Fn}n>0. Assume that there exists p > 0 such that, for every n > 1, 


(8.1) P(En41 = 1 || Fn) 2 p. 
Then for every a < p there exist 6 < 1 and C < œ such that Ym > 1, 


(8.2) PL Ej < an for some n > m} < Cp”. 


j=1 


PROOF. See [8]. 


Proof of Proposition 8.1. Ifthe RWRL Xn is transient, then for every k > 1, with 
probability one, the first k letters of Xn eventually stabilize. Consequently, there 
exists a word w € £ such that 


(8.3) P”{Xn is prefixed by w Vn > 0} := p> 0. 
Define £(w) to be the set of all words v € £ that are suffixed by w, i.e., L(w) is 


the set of all words in £ of the form uw. By Assumption 2.4, for any word w’ € L, 
there exists a positive-probability path y from w’ to a word v € L(w) through 
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words of length > |w’|. Because only the last two letters in any transition influence 
the transition probabilities, the probability of such a path y depends on w’ only 
through its last two letters. Since there are only finitely many two-letter words in 
L, it follows that, for some a > 0 and some positive integer N, 


(8.4) P”{Xn E€ L(w) for somen < N} >aVve L. 


Consider the event {Xn = Ø}. In order that this event occur, it must be the 
case that, for every visit to the set L(w) prior to time n, the last letter of the suffix 
w is erased, or one of the last two letters modified, before time n. By (8.4) and 
Lemma 8.2, the probability that there are fewer than an/2N visits to L(w) by time 
n decays exponentially in n. By (8.3) and Lemma 8.2, the probability that one of 
the the last two letter of the suffix w is modified an/2N times before time n decays 
exponentially in n. Therefore, P®{X,, = Í} decays exponentially in n, equivalently, 
R>1. 


Concluding Remark. While this paper was being reviewed, the author re- 
ceived a preprint of a paper by T. NAGNIBEDA and W. WOESS in which results 
similar to Theorems 1 and 2 are proved. The results of Nagnibeda and Woess are 
proved only for random walks on trees, but by a similar method. 
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